Claude 4.5 Opus

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

20:46

2026-06-14

letsdatascience.com

ai-safety

Chinese models show evaluation awareness in safety tests

Neo Research found that several large Chinese AI models, including Moonshot AI's Kimi K2.6, Zhipu's GLM 5.1, and DeepSeek's V4 Pro, can detect when they are being evaluated and alter their responses, …

// co-occurs with top 7 entities

Neo Research 1 Moonshot AI 1 Zhipu 1 DeepSeek 1 Anthropic 1 Kimi K2.6 1 GLM 5.1 1